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Abstract 

The happy function H : N —?' N sends a positive integer to the sum of the 
squares of its digits. A number x is said to be happy if the sequence {i/"(a;)}^i 
eventuahy reaches one. A basic open question regarding happy numbers is what 
bounds on the density can be proved. This paper uses probabihstic methods to 
reduce this problem to experimentally finding suitably large intervals containing 
a high (or low) density of happy numbers as a subset. Specifically we show that 
d > .18577 and d < .1138. We also prove that the asymptotic density does not 
exist for several generalizations of happy numbers. 
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1. Introduction 

It is well known that if you iterate the process of sending a positive integer 
to the sums of the squares of its digits, you eventually either arrive at 1 or the 
cycle 

4 ^ 16 ^ 37 ^ 58 ^ 89 ^ 145 ^ 42 ^ 20 ^ 4 

If we change the map, instead sending an integer to the sum of the cubes of 
its digits, then there are 9 different cycles that are possible (see Q). There are 
many possible generalizations of these kinds of maps, for instance Grundman 
and Teeple in considered the map which sends an integer n to the sum of 
the e'th power of its base b digits. When appropriate we adopt their notation 
of (e,6)-happy numbers, which are numbers eventually arriving at 1 under this 
map. The results of this paper, however, apply to a broader class of functions. 

Definition 1.1. Let b > 1 be an integer, and let h be a sequence of b ~ I non- 
negative integers s.t. h(0) = 0,h{l) — 1. We define the generalized b-happy 
function with digit sequence h as the extension H : Z+ — > Z+ where for n G Z+ 

k k 

with base b representation n — atb^ , H{n) = ^ h(ai). 

i=0 i=o 
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Fix a generalized &-happy function H. Let a = max If 71 is a d 

digit integer (in base b), then 

H{n) < ad 

If d* is the smaUest c? e N s.t. ad < b"^^^, then for all n with at least d* digits, 
H{n) < n. This implies the following 

Fact 1.2. Vn e n,3i s.t. W{n) G [Oy'-'^ - 1]. 

Moreover finding all the possible cycles for a generalized 6-happy function 
H amounts to performing a computer search on the trajectories of the integers 
in the interval [0, 6"^ ~^ — 1]. 

Guy asks in [6, problem E34] a number of questions regarding happy numbers 
and their generalizations, including the existence (or not) of arbitrarily long 
sequences of consecutive happy numbers and what bounds on the density of 
integers arriving at various cycles can be proved. To date, there have been a 
number of papers in the literature addressing the former question, for example 
In regards to the latter question the main result of this paper will 
allow one to prove (using computer calculations) a lower (upper) bound on the 
upper (lower) densities for any choice of H as in definition (|l.ll) , and any cycle. 

Our results only give one sided bounds. In an earlier version of this manuscript 
(posted on the arxiv), we asked, in the case of (2, 10)-happy numbers, if d < 1. 
Recently Moews Q has announced a proof of this, and along the way gives an 
alternate proof of the fact that the density does not exist. Specifically, he shows 
that .1962 < J < .38, and 0.002937 <d< .1217. 

2. Basic Definitions 

Remark 2.1. Unless otherwise noted, we regard an interval I — [a, 6] as the 
integer interval {n G \ a < n < fe}) where in general a, 6 G M. As usual we 
denote \I\ to be the cardinality of this set. We denote [n] as the set {0, 1, . . . , n}. 

Definition 2.2. Let I be an integer interval and Y the random variable uni- 
formly distributed amongst the set of integers C /. Then we say the random 
variable Y is induced by the interval I. 

Definition 2.3. Let H be a generalized b-happy function. Let C C N. Lf for an 

integer n, 3k s.t. H^{n) G C , then we say n is type-C . 

Definition 2.4. We say an integer interval I has type-C density d if 

\{n ^ I s.t. n is type-C}\ 

\T\ = 

Remark 2.5. IfY is the random variable induced by the interval L then the 
type-C density of L equals P{H{Y) is type-C). 

Usually the set C will be assumed to be a cycle we are interested in. However 
in the case we wish to prove bounds on the lower densities, we instead find the 
upper density of the set of type-C" integers, where C is taken as the union of 
all the possible cycles except for C. 
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2.1. The random variable H{Y„i) 

Consider the random variable Ym induced by the interval [0, 6™ — 1], i.e. Ym, 
is a random m-digit number. If Xi is the random variable corresponding to the 
coefficient of V in the base-6 expansion of Ym, then 

m— 1 

= H{X,) (1) 

1=0 

In this paper we will be interested in the mean and variance of HiYi) (i.e. the 
image of a random digit) which we refer to as the digit mean (/i) and digit 
variance (cr^) of H . The H{Xi) in ([ij are all i.i.d random variables, thus 

E[iJ(y„)] = ^lm, Var[i/(y„0] = (T^m (2) 

([1} implies a number a facts about H{Ym). H{Ym) is equivalent to rolling 
TO times a 6-sided die with faces = 0, 1, H{2), . . . , H{b — 1) and taking the sum. 
It approaches a normal distribution as to gets large. Also the distribution of 
H{Ym) is concentrated near the mean which implies the following key insight: 

Remark 2.6. The density of happy numbers amongst m digit integers depends 
almost entirely on the distribution of happy numbers near fim. 

2.2. Computing Densities 

Let P™,, = F{H{Y^) = i). Then 

m 

{(ai,a2, . . . ,a„i) | Ok G H{[b - 1]) and ^ afc = i} 



For fixed to, the sequence {Pm,i) has generating function 

4=0 ^ ^ 

This implies the following recurrence relation with initial conditions Pq.o = 1, 
Po,i = 0,ieZ-{0}. 



_ Pm-lS + Pm-l.i-l + Pm-l,i-H{2) + ' ' ' + Pm-l,i-H(b-l) , .s 



To see this, write = ( i±^±,df!^^xf<!^^ ( 1+-+^"'^'+-+-"'^-^' ) 

and consider the coefficient of a;^ 

If a = max (H{i)), then H{Y,n) Q [0,77za], and in particular Pm,i — 

2=0,... ,6—1 ' 

if J > ma. Using this fact combined with (j4]) we can implement the following 
simple algorithm for quickly calculating the type-C density of the integer interval 
[6™ - 1]. 
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1. First using the recurrence calculate Pm,i for i — 0, ... , ma. 

2. Using brute force, find the type-C integers in the interval [0,ma]. 

3. Output J2 Pm,i- 

[0,ma] 
i type-C 

Using this algorithm, calculating the density for large n becomes computa- 
tionally feasible. The Figure 1 graphs the density of happy numbers < 10" for 
n up to 8000. 



Figure 1: Relative Density of Happy Numbers < 10' 
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The peak near 10**°" and valley near 10^'^^° will be used to imply the bounds 
obtained in this paper. 

2.3. A Local Limit Law 

The random variable H{Ym) approaches a normal distribution as m becomes 
large. The following theorerrQ presented in [4, p. 593] gives a bound. 

Theorem 2.7. (Local limit law for sums). Let Xi, . . . , X„ he independent inte- 
ger valued variables with PGF B{z) , mean fi and variance where it is assumed 
that the Xi are supported on Z+. Assume that B{z) is analytic in some disc that 
contains the unit disc in its interior and that B{z) is aperiodic with B{0) ^ 0. 
Then the sum, 

Sn ■= Xi + X2 + • • • + Xn 

satisfies a local limit law of the Gaussian type: for t in any finite interval, one 
has 

¥{Sn = Uin + tay^\) = ^ (l + 0{n^)) 



^We quote a simpler version, with a minor typo corrected 
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Here aperiodic means that the gcd{j \ bj > 0,j > 0} = 1, where B{z) = 

oo 

J2 bjZ^ . In our case the probability generating function (PGF) of the H{Xi) is 
the polynomial 

P{x) = ^ 

Thus since we assume H{0) — 0,H{1) — 1 the above theorem applies for the 
sum H{Y„i)- As a consequence, for a fixed interval [— T, T], if z = jim + ta^/m 
for some t € [~T, T] then 

-t£ 

P™,, = tl- (l + O (m^)) 
V27rm(T V V 

The constant for the error term 0{m~^) is dependent on the choice of T, 
for our purposes we do not need an explicit constant. 

2.4- A Heuristic 

Remark 2.8. (General motivation for the proofs) Suppose I is a large interval 
containing type-C integers with some density d. Consider the choices of m s.t. 
the mean of H(Ym) is in the interval I , then for some choices of m we likely 
have 

¥{H{Ym) is happy) > d 

This of course is just a heuristic, and we will only be able rigorously prove 
the weaker result, 3m s.t. F(^H{Yjn) is happy) > d(l — o(l)). The key idea for 
the proof is to average over all reasonable choices of m in order to imply there 
is an m with the desired property. 

Our main tool will be using theorem (|2.7p to say that for small fc, 
and H{Ym+k) have essentially the same distribution only shifted by a factor of 
Ilk. Thus as m varies, the distributions of the H(Ym) should uniformly cover 
(with small error) the interval /. This argument will not work immediately, as 
the fuzzy term in the local limit law prevents us from obtaining explicit bounds 
on the error (and any explicit bounds seem unsatisfactory for our purposes). 
Section 3 adds a necessary step, which is to construct an interval within [&"~^, 6"] 
with high type-C density for n arbitrarily large. The main result is presented 
in section 4, the proof uses the local limit law with the result from section 3. 

3. Constructing Intervals 

Definition 3.1. We say an integer interval I is n-strict if I ^ — 1], 

and \I\ — . 
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The primary goal of this section is to construct n-strict intervals of high 
type-C density for arbitrarily large n. 

Our choice of the definition of n-strict is only for the purpose of simplifying 
calculations, there is nothing special about the value |, in fact any ratio > ^ 
would work. Note that if 4 does not divide n, then no n-strict intervals exist. 

For the entirety of this section we will make the following assumptions: 

• _ff is a generalized 6-happy function with digit mean /z and digit variance 

• We are interested in the upper density of type-C integers for some C C N. 

• We have found via computer search an appropriate starting interval Ii 
which for some ni G N, is ni-strict and has type-C density = di. 

The results in this section only apply if this ni is sufficiently large, so we state 
here exactly how large ni must be so one knows where to look for the interval 
Ii. In particular we say an integer n satisfies bound (B) if 

Bl: a{i + 3/i + \/2cr&^) < 

B2: ^/iiiba < 

B3: ifi + + 2afi^b^^ < 6"-i 

For most of the interesting cases of 6-happy functions, for example the cases 
(e, b) s.t. e < 5, assuming n > 23 is enough to guarantee that it satisfies bound 
(B). This is well within the scope of the average computer, as it is possible to 
calculate the densities for n up to (and beyond) 1000 using the algorithm in 
section 2. These bounds are necessary in the proof of theorem p.Sp . 

Our first goal is to use an arbitrary n-strict interval / to construct a second 
interval I2 which is n2-strict for some n2 much larger than n and contains a 
similar density of type-C integers as /. The next lemma will be a helpful tool. 

Lemma 3.2. Let I — [^1,12], J = [ji,j2] be integer intervals. Let S* C / 
and Y a random variable taking integer values in J. For fc G Z denote Tk{Y) 
as the random variable Y + k. Then there exists a (possibly negative) integer 

ke[n-32,t2-.n] s.t. P{Tk{Y) e S) > jjj^jjj^- 

Proof. The idea of the proof is that by averaging over all appropriate fc, the 
distributions oiTk{Y) should uniformly cover /. More formally, let fci = ii —^'2, 
= *2 —j2, and let K be the set of integers in the interval [ii — j2, i2 —ji], which 
we denote as {ki, ki + 1, . . . , ^2}. Note that \K\ = \L\ + \ J\ — 1. Pick k uniformly 
at random from K, and consider the random variable Z = Pr(Tk{Y) G S*) . Then 

' ' k=ki 
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k—ki i^S 
k2 



-J2J2nrkiY)^^) (5) 



1^1 + 1^1 -^..S.^.. 
Note that V{Tk{Y) ^ i) = V{Y = i - k) . And for i e /, i - fci < jz, i - k2 > ji 

Thus yieS, V{-rk{Y) = i) ^ I. Therefore 

k—ki 



/i + 1^1-1 



So 3k s.t Pr(Tfc(r) e S) > E[Z] = iJillli_i proving the claim 



□ 



Using lemma p.2p we will not lose much density assuming that |/| is much 
larger than \J\. However when considering the random variable H{Ym) the set 
|J| will be much too large. As a result it will be more useful to consider a 
smaller interval where the bulk of the distribution lies. 

Lemma 3.3. Let Y be an integer valued random variable with mean /iy and 
variance Uy, and let A > 0. Let S be a set of integers C [11,12] — L with density 
d. Then 3k an integer e [ii — {fiy + 0'yA),i2 — {^y — cyA)] s.t. PrirkiY) S 

^)>(i-^)(-4^). 

^+ \i\ 

Proof. By Chebyshev's InequalitjH we have Pr{\Y — ^y\ < cyA) > 1 — jy. 
Let Y' be the random variable given by Y conditioned on being in the interval 
J = [^y-crYA,^y + cryA]nZ. Notc | J| < 2(Ty A + 1. Then bv lemma (|3?2|) there 
exists an integer fee [ii ~ {fJ-y + aY^),i2 — [p-Y ~ cry A)] s.t. f{Tk{Y') G 5) > 
= T^^Tx. Therefore we have P(rfc(y) e S) > P(y G J)P(rfe(y') G 

It is possible to construct sets of intervals which under the image of H 
act as shifts of each other. For example in base-10 (and assuming that -ff(O) = 
0, H{1) — 1) if the random variable Xi is induced by [1100, 1199] and A"2 induced 
by [0,99] then H(Xi) = H{X-2) + 2. 

We will now further expand on the example above. Consider n G N, s.t. 
4 I n. Let So ~ [0, &t — 1] and for k = 1, . . . , ^ consider the interval 



^We certainly could do better than Chebyshev's Inequality here. However, the bounds it 
gives will suit our purposes fine. 
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Then the Bk wih all be n-strict (with exception of Bq), and a random integer 
X S Bk will have the following base-6 expansion: 



X = 0(L^ XiXi^i . . . Xi 

k digits f-fc digits 3|. digits 

That is X will have its first k digits = 1, the next ^ — fc digits = 0, and the 
remaining ^ digits will be iid random variables Xi taking values 0, 1, 1 
with uniform distribution. 

Let Iq be the random variable induced by Bq, and Yfe be induced by Bk- 
Again because H{Q) = 0,H{1) = 1, 

H{Yk) = H{Yo) + k = Tk {H{Yo)) 



Recall H{Yo) has mean ^fj., and variance ^cr . Consider an interval / 
[ii, 12] containing a set of type-C integers S, and let A > 0. By lemma 



3k' e 



3n 



Act 



)--( 



3n 




s.t. 



Thus if / C 



(HiYo)) e S 



1 + + 



> 



-Ao- 



(6) 



then 1 < fc' < f . Setting 



k = k' produces the interval B^, which will be n-strict with type-C density 
> ([6]). In fact we have proven the following 



Theorem 3.4. 

1 + |n/i + Act 
density 
> 



Let n e N with 4 \ n, let C C 



7+ 



Given X > let Jn.\ 



Act 



1-^ 



n . Suppose 31 C J^.x with type-C 

d. Then there exists an n-strict interval I2 s.t. I2 has type-C density 
d 



Suppose we have found an rt-strict interval / with a desired type-C density 
d. We can then apply the previous theorem in order to construct a second 
interval I2 which is n2-strict for some n2 much larger than n assuming that 3n2 
s.t. 4 I n2 and / C Jn2,\- The question now becomes, how large must n be 
in order to guarantee the existence of n2 and A (A large enough as to not lose 
much density) s.t. / C Jn2,\- 

Such an 712 if it exists would be « (looking at the dominant term of 

the left endpoint of Jn2,x)^ which would give 



I J, 



712, X\ 



6" 
3^ 



Xab 2 



If this n2 is to work we'd surely need b ^ = \I\ < \ Jn2,\\- If ^ is small then it is 
quite easy to find such an n2, but we may in turn lose too much density. However 
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as n gets large we can find such an 7X2 while keeping A large. In an attempt 
to make calculations as simple as possible, the next theorem sets X = b'» (this 
is chosen for simplicity, not to optimize the result) and then finds how large 
n must be in order for an appropriate 712 to exist. The proof follows from a 
number of routine calculations and estimations, some of which we have left for 
the appendix. 

Theorem 3.5. Suppose I is n-strict with type-C density — d, where n satisfies 
the hound (B). Then 3n2 > ^ ^ and an n2-strict interval I2 s.t. the type-C 

density of l2>d{l-h^^ [l - "^^^ 



Proof. As before, let J„ 



We 



: 1 + + Act Y |to, + — Xa^ |to 

assumed that / is n-strict, so |/| = . Call the left endpoint of /, a, and write 
/ = [a, a + 63" — 1]. Setting A = 6t we attempt to find an 712 divisible by 4 s.t. 

I ^ Jn-j.x- It would be prudent to consider /(m) = 1 + + Xa^J^m which 
represents the left endpoint of Jm,\- We first find an integer n2 s.t. /(n.2) is 
close to yet still < a. By lemma (j6.2p in the appendix, assuming n satisfies (B) 
it follows that there exists 712 s.t. 

• 4 I 712 

• ^ < 712 < T-^" 

• < a - /(77 2) < 3^+ 1 

We now check that / C J^^ in order to invoke Theorem 13.41 We already 
have that the left endpoint /(n2) < a- So now check the right endpoints of I 
and J„2. We need 



1 + < !J + -n2^l - Act J -772 (7) 



The above is equivalent to 



/3 , /3 \ ,3r, 772 /3 
a — -7^211 + ActW -772 + 1+0«1 <— zActW -7l2 



Simplifying, the above follows from showing 



a - /(772) + feT + 2Act^/ ^772 < ^ 



Now letting 



Then 



LHS = a - /(772) + feS" + 2Act\/^772 



LHS < 3^7 + 1 + 53" + Xay/S^ 
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Using the fact that X = b» ,n2 < we get: 

LHS <3fi + l + 6^" + 2-^6* 

And letting RHS = ^ , by assumptions on n2 

6" 



So ([7]) follows from 



3/i + 1 + 65- + 2^6* < 



The above is exactly the bound (B3). Therefore / C J„2,a- Thus by applying 
Theorem p.4p with A = fet , 3 an 712-strict interval I2 s.t. 



type-C density of/2>(^l — t^^^^- 



d 



JTl 



Since n2 < it follows that, 



2(7 , 

> > 1 h— 



Thus we conclude the type-C density of /2 > d ^1 — ^1 — -^6~5-^ . □ 

Apply the previous theorem to our starting ni-strict interval Ii to get an 
n2-strict interval l2- Since n2 > ni, we can apply theorem p.Sp again on 12- 
Continuing in this fashion produces a sequence of integers {rii), and rti-strict 
intervals {Ii) each with type-C density di s.t. 

• n,+i > ^ 



• d^+i > d, [1 - b^j [1 - 



The second condition implies 

00 „ 
y^,d.>d^Y[[{l-b^)il-^b^)) (8) 



i=l 



Looking at the first condition, the sequence (n^) exhibits super exponential 
growth which clearly implies that Vi,ni > ini. The only worry would be when 
/Li is very large, but the bound (B) takes care of this case. 

Note for a:, a e E+, if a; > 2a > then 

1 - ax-i > r > 

- 1 + 2aa;-i " 
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Applying this first with a = l,x = b ^ and second with a = ,x~b»,we 
may now rewrite ([S]) as 

4 \ 
V -26^ -b^ I 

Using the fact that Wi^rii > mi, the sum in the previous inequality will become 
the sum of 2 geometric series, one with r = b^^,a = —2b~^^ and the second 
with r = b~^,a = ^^b~^ . RecaU that a geometric series with |r| < 1, and 

a 



_-4(T 

first term a sums to 



1 - r 



Therefore the first series sums to , the second sums to 

Finally after simplifying we conclude that 



Vi, > diexp 



-4cr 



Thus we have proven the following 

Theorem 3.6. Assume 3ni satisfying the hound (B), and an ni-strict interval 
Ii with type-C density = di. Then €N,3n> N, and n-strict interval I s.t. 

/ 2 4cr \ 

the type-C density of I > diexp 



4. Main Result 



As in the previous section we continue to assume H is & generalized b- 
happy function with digit mean and variance /x,cr^. Also we assume that we 
have experimentally found a suitable starting ni-strict interval Ii with type-C 
density di for some C C N. As in section 2, for positive integer m, let Ym be 
the random variable induced by the interval [0,6™ — 1]. 

In this section we give a proof of the following 

Theorem 4.1. Suppose Ii is ni-strict, where ni satisfies the bound (B) and Ii 
has type-C density = di. Then the upper density of the set of type-C integers is 

> di exp { —Krr + ) 

The digit mean and digit variance for the case (e, b) — (2, 10) is 28.5 and 
721.05 respectively. In this case, if n > 13 then it satisfies bound (B). After 
performing a computer search we find that the density of happy numbers in 
the interval [10^"^, 10''°'' - 1] is > .185773, thus there exists a 404-strict interval 
having at least this density of happy numbers as a subset (it is necessary here 
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that 4 I 404). Letting S{n) = (^jlJ + ^(T b% ) ) l'^(404)| < IQ-''^. 

For the lower density, the type-{l} density of [lO^^^^^ iQ^^eS-l] is < .11379. This 
implies that there is a 2368-strict interval with type-{4} density > 1 — .11379, 
apply the main result to conclude that the upper density of type-{4} integers is 
> 1 — .1138. This gives the following 

Corollary 4.2. Let d and d be the lower and upper density of {2,10)-happy 
numbers respectively. Then d < .1138, o? > .18577. 

In general, by taking C to be the union of all cycles other than C. We get 
as a corollary an upper bound on the lower density of type-C integers. 

The proof of theorem (14. 1|) is technical despite having a rather intuitive 
motivation. For the sake of clarity we first give a sketch of how to use theorem 
p.6p with theorem (|2.7|) in order to prove a lower bound on the upper density 
of type-C numbers. 

Given our starting ni-strict interval Ii apply the theorem p.6p to construct 
an 71-strict interval / (n large) with type-C density > (1 — o(l))di. Pick mi 
s.t. the mean of H{Ym-^) ~ umi lands in the interval /. Since / C [6"^^, 6"] we 
have mi — 0{h'^). This implies the standard deviation is 0(6^). This will be 
much less than |/| = h~r . 

Next, use lemma p.3p with a large A, to find a shift tx- (i? (y„jj)) which has 
probability > (1 — o(l))di of being type-C (note this fc < |/| = 6t). The mean 
of Tk[H{Ymi)) ~ firrii + k. Clearly there exists an integer m2 s.t. 

1^7712 — it^mi + k)\ < pi 

The means of H{Y„i^ ) and (iJ (Ymj ) are almost equal. Since k is much smaller 
than mi, 7712 we apply theorem (12. 7p to argue that within some (large) neigh- 
bourhood of the means, the distributions of H{Ym2) and rfc(if(l^J) are nearly 
identical. We conclude that V(^H{Ym2 ) is type-C) > (1 — o(l))di. Doing this for 
(arbitrarily) large n (and therefore m2) we find interval [0, 6™'^ — 1] with type-C 
density > di (1 — o(l)). This in turn shows that the upper density of type-C 
numbers is > c?i (1 — o(l)). 

4.1. Some Lemmas 

Making the above sketch fully rigorous requires the following e argument. 
We have broken some of the pieces down for 3 lemmas. The proofs primarily 
consist of calculations and we leave them for after the proof of the main result. 
We first make the following remarks about lemmas (14. 3p and (14. 5p . 

• It is not important how large the required N must be. 

• Lemma (|4.5p part (1) is the only place where theorem (|2.7I) is needed. 
Lemma 4.3. 3N sufficiently large s.t. Vn > A^, V/ n-strict, 3m G N s.t. 

[fim — am's , /im + urn s ] C / 
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Lemma 4.4. Let e > Q be given (assume as well that e < 1). Let A = 
Then 3N{e) s.t. if any n, mi, I satisfy: 

• n> N 

• I is n- strict 

Then the following hold: 

1. A < mi5 

2. |1- i^|<| 

J-H 3n 

b 4 

Lemma 4.5. Lei e > 6e (^iwen (assume as well that e<l). Let T = and 
A = Then 3A''(e) s.i. if any n,mi,m2,k,I satisfy: 

• n> N 

• |fc| < 6t 

• l/Ltmi + fc — /xm2| < /U 

• I is n-strict 
Then the following hold: 

1. Fori =1,2; max 11 - ^Kgl^^.O^L/^^.+toV^J) , < e 

2. 11 - ,/^| < I 

3. If3ti,t2 s.t. ti G [-f ) i"] and iimi+k + tiay^mi = Iim2+t20'y^m2- Then 

6- 



t2 e [-T,T] and |l-e^^^| < |. 



Proof of Theorem 4-1 
Proof. Let /(n) = |{i G Z+|i < n, Hs type-C}|. We must show that 

/(") ^ , 2 4cr \ 
limsup > aiexp ttt H 

It suffices to show Ve > 0,\/Ni e N,3m > A^i s.t. the interval [0,6™ - 1] has 
type-C density > diexp + ^^"^ (1 — e). Let e and A''i be arbitrary 

(assume only that e < 1). Set T = (we will be applying the local limit 
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law for the bounded interval [— T, T] later in the proof). Also in anticipation of 
applying lemma (|3.3p set A = 

First pick N > Ni large enough to apply lemmas (|4?3)) . (|44)) . and (|4?5|) . 
Invoke theorem (|3.6p to construct an n-strict interval I ioi n > N with type-C 
density d. Then 

For m e N let F,„ be the random variable induced by the interval [0, 6™ — 1], 
also let 

Jm — [fJ-m — <Tm» , /im + crms] 

Recall again that 

nH{Ym)] = Aim 
Var[if(y„0] = CT^m 

Hence J„i is where the "bulk" of the distribution of H{Ym) lands. Pick mi 
s.t. J„jj^ C / (the existence of which is guaranteed by lemma (|4.3p ) . Note that 

mi € [^7^' ^] since / is n-strict. Let S be the set of type-C integers in /. 
Apply lemma (|3.3p on H{Ym^) and A to find integer k s.t. 



Since Jm^ C / and |/| = 6t it follows that k < 6t. 



Recall the proof of lemma p.3p . In particular we ignored the tails of the 
distribution of H{Ymi) outside of Act^toi from the mean. Notice as well that 
A < mi 5 (by lemma (14.41) part 1). Calling 5' the set of type-C integers in the 
shifted interval Tk{Jmi), we may replace (jlOp by the stronger conclusion 



P(r,(i/(y„j) ^^)> d[l ^) (— 



E 



Using the assumption that A = y | and part 2 of lemma (|4.4I) we simplify 
the above as 

^P(r,(i/(y,„j) = i) >rf(l-i)2 (11) 
ieS' 

Now pick m2 G N s.t. jmi/i -f fc — m2/i| < /i. Since |fc| < , it follows that 

m2 € , ]■ In particular toi, m2, n, k, I now all satisfy the conditions of 

lemma (|4.5p . Let be the r.v. induced by the interval [0, b"^^ — 1]. It remains 
to show that near the mean of Tk[H{Xmi)) , the distributions of Tfe(i7(FmJ) 
and HiYm2) are similar, this will imply that the interval [0, — 1] contains a 
large amount of type-C integers. Making this precise, we prove the following 
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Claim 1. For integers i G Tk[Jmi) 

p(g(y,„j = ») 

P(rfc(i/(r™j) =z) " 6 
Proof. Wc let i G J„ij be fixed and pick ti,t2 s.t. 

i — /iTTii + fc + ticr^mi = fim2 + t2(7y/m2 
It is important now that we had chosen A = ^, this implies \t2 \ < T (see lemma 



(|4.5p part (3)). We can use the local limit law to estimate the distributions of 
Tk{H(Ym,)) and H{Ym,). By lemma part (1) 



6 2 e 



2 TT (7^/7712 6 



and 



27rcr->/mi 6 ' 



Hence 



P(rfc(i?(y™J) = 
Which by lemma (jTS]) parts (2), (3) 



— i i — > e 2 -^^ 2. 



>(i-i)* 



□ 



Let ^2 be the density of type-C numbers in the interval [0, 6™^ — 1]. Then 

»(i/(y„,j = z) 

ry^ii(i„,^) = '<■) = 2^ — 
ieS' ™ 



d2>Y.HH{yn.,) = ^) = Y. , , 



Which by claim 1 



>(l-ir^p(r,(iJ(F„J) 



And by equation (fTTj): 



Finally by equation ©: 



>d{l--f>d{l-e) 



6,2 > diexp 



This concludes the proof. 



4ct 



^(l-6Tr) 



□ 
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4-3. Proof of Lemma 4-2 

Proof. Let Jm — [^m — am's ^ + am's]. Then \Jm\ < 2ami +1. We assumed 
/ is n-strict, so / C 6" — 1], let a be the left endpoint of /. Picking m s.t. 

fj,m G I will give m = 0(6"), which in turn gives \ Jm\ — 0{b'^) <C |/| = . 
Comparing the growth rates of \Jm\ and |/| it is clear that we can pick iVi large 
enough s.t. n > Ni =^ 3m s.t. Jm'^ I. □ 

4.4. Proof of Lemma 4-3 

Proof. We find Ni , N2 for the 2 parts respectively, then choose N — max( A^i , -/V2) . 

1. A is a fixed constant here and it is assumed that toi > — , so the result 
is trivial (this gives iVi). 

2. For a; > to show |1 — < f , it is equivalent to show 

e 1 e 

1 - - < < 1 + - 

6 - l + x - 6 



Or equivalently 



(1-^)(1+^)<1<(1 + ^)(1 + ^) 



The above follows if x < ^. So we need N large enough s.t. ^'^'^.V^ < 1 Using 
the assumption that mi < ^ we get 

2crA,/TOi 2(tA 



6t y/J^b' 
And thus the result follows if 

12crA , . 
< b~ 

Hence picking N2 > 41ogh(^^=^) suffices. 



□ 



4-5. Proof of Lemma 4-4 

Proof. We find Ni, N2, N3, for the 3 parts respectively, and then choose N = 
max(iVi,iV2,^3)- 

1. As shown in section 2, HiYmi) is the sum of mi iid random variables each 

with mean /i and variance . Thus the result follows from applying theorem 
00 

(|2.7I) to the sum ^ H{Xmi) with finite interval [— T, T], getting Mi large enough 

i=l 

S.t. for mi > Ml, for aU t e [-T.T], the fuzzy term 0(mi~) < |. Do the 

same on the sum for HlXmn) to get M2. By the assumption mi,m2 > 

choosing 7V3 = max(^Mi + 1, ^M2 + 1) suffices. 

2. We can ignore the square root, it suffices to show that 

II -^l<^ (12) 
m2 6 
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By assumption 

l/iTOi + k — /im2| < /i 
Divide through by /^TO2 and get 

|i_!!^__L|<^ 

7712 /im2 m2 

This foUows from 

k 1 ^ mi 1 
<1 i< [ 



m2 m2 m2 fJ,m2 
Thus (1121) follows from showing — H — ^ < | . Using the assumption m2 > ^ 

^ ^ 7712 ^Tll2 — D ^ — /i 

and |fc| < fe'^ it follows that 

m2 /iTO2 

Picking 7V4 = max(logb(i^) + 2,41ogf,(J^) + 2) works. 

3. We first find N' s.t. n > iV' =4> i2 e [-^,7]. Starting with the 
assumption that 

/zmi + fc + ticr^mi = /im2 + t2(Ty/rri2 

Rearranging it, using the facts that |/imi + fc — /im2| < and |ti| < ^ this 
implies that 

|t2|< ^ 



<JyJm2 2-^7712 

We assumed that ?Ti2 > j a-lso in part (2) we showed 37V4 s.t. n > 
^ < (1 + 1) < ?• Hence 

^7712 — ^ 6 — 6 



1*21 < 
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Pick iV' > A^4 large enough s.t. n> N' =4> — < This will take care 

of the size of t2 ■ 

Now we must show that 3iV" large enough s.t. n > N" =^ 

|l-e^|<- 

After finding N" we set N5 = ma.x{N',N"). Note that for x e M to show 
1 1 — I < I it suffices to show that 

ln(l -^)<x< ln(l + 
6 6 
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Thus letting e* = min(ln(l + §), | ln(l - f )|) we find N" s.t. n > N" 



< €* 



It was assumed that 



/imi + k + tiay/mi — fim2 + t2(J\Jm2 

Or equivalently 

Applying the assumption that \LliS\ < /i and dividing both sides by a^/rri2 we 
get 

1 / mi , a 

1^2 -ii,/ — I < ^ 

V 1712 Jm2<7 



\t2-tl+h{l 



t2-h < 



"^1)1 < ^ 

m2 ~ \ffn2(y 



V "l2 



Recall by assumption m2 > ^-^^7^, 1*21 < and by the proof of part (2) 



1 - < '"4 . Putting this together it follows that 



/ 2 i 2 

12 — II 




2 





(^)(t2-*l) 



< T 



Now since T, /i, cr, 6 are all constants, lim RHS = so 3N" s.t. n > N" => 
RHS < e*. Finally set N5 = max(7V',lv''). □ 



5. Experimental Data 

The dataH presented in this section is the results of short computer searches, 
so the bounds surely can be improved greatly with more computing time. Float- 
ing point approximation with conservative rounding was used. 

5.1. Finding an Appropriate n-strict Interval 

If n is divisible by 4, and the interval — 1] has type-C density d, 

the there exists an n-strict interval with type-C density >= d, which we may 
apply theorem (j4.ip to. The density of [&"~"'^, &" — 1] for various n can be quickly 
calculated by first computing the densities of intervals of the form [0, 6" — 1], the 
algorithm for which was discussed in section (2.2). After an appropriate n-strict 
interval is found, we check to see that n satisfies the bound (B), compute the 
error term, and find the desired bound. Our results show that in almost all 
cases, the asymptotic density does not exist. 



•^Data generated by fellow graduate student, Patrick Devlin. Also it was double checked 
for smaller n using a slower Maple program written independently by the author. 
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5.2. Explanation of Results 

The following information is given in tables (in the order of column in which 
they appear): 

1. The cycle in which type-C densities are being computed. 

2. The lower bound on the upper density (UD) implied by theorem (I4.ip 

3. The upper bound on the lower density (LD) implied by theorem (14. ip 

4. The n s.t. the interval [b"~^, &" — 1] is used to find the bound (denoted as 
UD n or LD n 

5. The S(n) = ( + part of the error term for Theorem (14.11) 

(we only present an upper bound on |(5(n)|, the true number is always 
negative). In all cases the error is small enough not to affect the bounds 
as we only give precision of about 5 or 6 decimal places. 

5.2.1. Cubing the digits 

In this case if n > 16 then it satisfies bound (B). Table 1 shows the results 
for the cycles when studying the (3, 10)-happy function. There are 9 possible 
cycles. Figure 2 graphs the density of type-{l} integers less than 10". It is easy 
to prove in this case that 3 | n <^=^ n is type-{153}. 



Table 1; Bounds for the Cycles Appearing for Digit Sequence 
0, 1, 8, 27, 64, 125, 216, 343, 512, 729] 



Cycle 


UD 


LD 


UD n 


LD n 


UD S{n) 


LD S{n) 


{1} 


> .028219 


< .0049761 


10864 


10132 


< 10-^"*^ 


< 10-^4 


{55,250,133} 


> .06029 


< .0447701 


10""« 


loyw 


< 10-^4 


< io-^^« 


{136,244} 


> .024909 


< .006398 




10™ 


< 10-^^ 


< 10-^^ 


{153} 


_ 1 


_ 1 


N/A 


N/A 


N/A 


N/A 


{160,217,352} 


> .050917 


< .03184 




104bt> 


< io-^« 


< io-^« 


{370} 


> .19905 


< .16065 




lObriu 


< 10-^^ 


< io-«« 


{371} 


> .30189 


< .288001 


10«^« 


1042U 


< 10-^"^ 


< 10-'^" 


{407} 


> .04532 


< .0314401 


10420 


10«3B 


< 10-^" 


< lo-^'^^ 


{919,1459} 


> .04425 


< .01843 




10i2U 


< 10-"^ 


< 10-^^ 



5.2.2. A More General Function 

In order to emphasize the generality of theorem (j4.1[) . we consider the func- 
tion in base-7 with digit sequence [0, 1, 7,4, 17,9, 13]. There are the following 
two cycles (written in base-10) {1}, {20}, both are fixed points. The percentage 
of numbers arriving at 1 is large in the case, varying from .94 to .98. Figure 
3 graphs the relative density of type-{l} numbers. Table 2 shows the bounds 
derived. As there are only two cycles, we focus on the cycle {1}. In this case, if 
n > 12 then it satisfies bound (B). 
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Figure 2: Relative Density of Type-{1} Integers for Digit Sequence 

[0, 1, 8, 27, 64, 125, 216, 343, 512, 729] 




100 200 300 400 500 600 700 800 900 1000 



Table 2: Bounds for the Cycles Appearing for Digit Sequence [0, 1, 7, 4, 17, 9, 13] 



Cycle 


UD 


LD 


UD n 


LD n 


UD d{n) 


LD 6(n) 


{1} 


> .9858 


< .94222 


7I76 




< 10-^7 


< io-«' 



Figure 3: Relative Density of Type-{1} Integers for Digit Sequence [0, 1, 7, 4, 17, 9, 13] 




1000 2000 3000 4000 



6. Appendix 

Lemma 6.1. Let a > 0, assume f : R+ — > ]R+ has continuous first and second 
derivatives s.t. Vx G R+. f'{x) > and /"(x) < 0. Also we assume lim f{x) = 

00. Furthermore suppose we have x* G K+ s.t. f{x* + 1) < a. Then 3n e N 
s.t. n>x* andO<a- f{n) < f'{x*). 

Proof. This follows from a first order Taylor approximation of the function /, 



20 



for completeness we have included the details. Let x* s.t. f{x* +1) < a be 
given. Set n = sup{m G N|/(m) < a}. Since / is strictly increasing and 
unbounded this n exists. So we have /(n) < a and f{n + 1) > a. Note that 
n > X* as otherwise [cc*] would be the suprimum. By the concavity of / we 
have f{n + 1) - f{n) < f'{n) < f'{x*). But /(n + 1) > a, so we conclude 
0<a-/(n)</'(x*). □ 



Lemma 6.2. Let n be a positive integer, X = bs , and a G [6" ^,6"]- A 



ssume 



that fi, a are the digit mean and variance of some generalized b-happy function 



H . Assume also that n satisfies the bounds in (B). Let f{n) — l + jfin+Xa\/ |n. 



Then 3 an integer n2 s.t. 

• ^ <n2< r^b" 

• 4 I 712 

• < a - /(ns) < 3/i + 1 
Proof. Since we require that 4 | n2 we apply lemma [GHI on the function g{m) 

An 



f{4:m) — l + 3iim + Xa\/3m. Setting x* — we first check that g{x* +1) < a. 
Thus we need 



l + 3Ai(^^ + l) +6^(7^/3(^-^ + 1) <a 
Simplifying the LHS, and using the fact that a > 6"^^, it suffices to show 




3 6"-^ 
l + 3Ai + 6^CTW — +36-« < — — 
46^ 4 

To keep the results of this paper proof as general as possible, we only assumed 
that M ^ i (this would correspond to the quite uninteresting b-happy function 
H which is = on all digits except for 1). Also clearly 6" > 3, therefore 
J- + 36-" < I + 1 < 2. And so 

LHS < 1 + 3^ + V2ab^ 

So the statement follows if 

A{l + 3fi + V2ab^) < 6"-i 
This is exactly the bound (Bl). Thus by lemma [5TT] 3m £ N s.t. 

And 

g {x ) = 3^* + 



4m 
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= 3/i + ^/3|J^ab^ 
Again by assumption (B2) on n, the previous statement is 



< 3/i + l 

Thus setting n2 = 4to we have 4 | n2, n2 > and < a — /(n2) < 3/x + 1. 
We note finally that /(^&") > a, and since / is strictly increasing we conclude 
furthermore that n2 < 4-^^- □ 
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